AITopics | misspecified simulator

Collaborating Authors

misspecified simulator

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Offline Imitation Learning with a Misspecified Simulator

Neural Information Processing SystemsDec-24-2025, 02:42:07 GMT

In real-world decision-making tasks, learning an optimal policy without a trial-and-error process is an appealing challenge. When expert demonstrations are available, imitation learning that mimics expert actions can learn a good policy efficiently. Learning in simulators is another commonly adopted approach to avoid real-world trials-and-errors. However, neither sufficient expert demonstrations nor high-fidelity simulators are easy to obtain. In this work, we investigate policy learning in the condition of a few expert demonstrations and a simulator with misspecified dynamics. Under a mild assumption that local states shall still be partially aligned under a dynamics mismatch, we propose imitation learning with horizon-adaptive inverse dynamics (HIDIL) that matches the simulator states with expert states in a $H$-step horizon and accurately recovers actions based on inverse dynamics policies. In the real environment, HIDIL can effectively derive adapted actions from the matched states. Experiments are conducted in four MuJoCo locomotion environments with modified friction, gravity, and density configurations. Experiment results show that HIDIL achieves significant improvement in terms of performance and stability in all of the real environments, compared with imitation learning methods and transferring methods in reinforcement learning.

misspecified simulator, name change, offline imitation learning, (5 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Review for NeurIPS paper: Offline Imitation Learning with a Misspecified Simulator

Neural Information Processing SystemsJan-24-2025, 23:04:18 GMT

Summary and Contributions: The authors are proposing an improvement on existing approaches for imitation learning of policies for embodied agents. The approach is a hybrid between sim-to-real RL approaches (which require a simulator closely matching the real world) and real world imitation learning approaches such as GAIL. The general idea of the paper is that there is a simulator, which, however is allowed to have a different dynamics than the "real world". In particular, the assumption is that two policies can reach the same goal state from the same starting point within H steps in the real-world. The algorithm is tested on the OpenAI Gym environment, where both the real world and the simulator environment are simulations (with different parametrization).

misspecified simulator, neurips paper, offline imitation learning, (1 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.32)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.32)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.32)

Add feedback

Review for NeurIPS paper: Offline Imitation Learning with a Misspecified Simulator

Neural Information Processing SystemsJan-24-2025, 23:04:11 GMT

This paper had a wide spread of reviews and generated significant discussion amongst the reviewers. In the end, the majority of the reviewers agreed that while the main contribution was slightly lacking in novelty (in the sense that it was mostly a retargeting of a known technique to a new setting), it was still a valuable contribution. However, there was not a total consensus, due to R1 having significant concerns about how the paper was written. That said, the majority of reviewers think the paper is strong enough to be accepted, so I recommend that it is, with the caveat that the authors pay close attention to the revision suggestion of R1 to improve the communication of ideas.

misspecified simulator, neurips paper, offline imitation learning, (2 more...)

Neural Information Processing Systems

Technology:

Information Technology > Artificial Intelligence > Robots (0.40)
Information Technology > Artificial Intelligence > Machine Learning (0.40)

Add feedback

Offline Imitation Learning with a Misspecified Simulator

Neural Information Processing SystemsOct-10-2024, 08:26:15 GMT

expert demonstration, misspecified simulator, offline imitation learning, (2 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback